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1. Introduction 


The most recent version of the GEOS-5 Atmospheric General Circulation Model (GEOS-5 AGCM) 
uses the non-hydrostatic finite-volume dynamics (an extension of Lin, 2004) integrated with various 
physics packages (e.g., Bacmeister et al., 2006), under the Earth System Modeling Framework 
(ESMF, Hill et al., 2004) including the Catchment Land Surface Model (CLSM) (e.g., Koster et al., 
2000). The GEOS-5 AGCM is documented in Rienecker et al. (2008) with more recent updates 
described in Molod et al. (201 1). 

This document describes the specific GEOS-5 model configuration used to perform a two-year global, 
non-hydrostatic mesoscale simulation for the period 2005-2007 at 7-km horizontal resolution. 

Because this simulation is intended to serve as a reference Nature Run for Observing System 
Simulation Experiments (OSSEs, e.g., Errico et al., 2012) it will be referred to as the 7-km GEOS-5 
Nature Run or 7-km G5NR. This simulation has been performed with the Ganymed version of GEOS- 
5, more specifically with CVS Tag wmp-Ganymed-4_0_BETA8 . 

In addition to standard meteorological parameters (wind, temperature, moisture, surface pressure), 
this simulation includes 15 aerosol tracers (dust, sea-salt, sulfate, black and organic carbon), O 3 , CO 
and CO 2 . This model simulation is driven by prescribed sea-surface temperature and sea-ice, as well 
as surface emissions and uptake of aerosols and trace gases, including daily volcanic and biomass 
burning emissions, biogenic sources and sinks of CO 2 , and high-resolution inventories of 
anthropogenic sources. 

The simulation is performed at a horizontal resolution of 7 km using a cubed-sphere horizontal grid 
with 72 vertical levels, extending up to 0.01 hPa (~ 85 km). For user convenience, all data products 
are generated on two logically rectangular longitude-latitude grids: a full-resolution 0.0625° grid that 
approximately matches the native cubed-sphere resolution, and another 0.5° reduced-resolution grid. 
The majority of the full-resolution data products are instantaneous with some fields being time- 
averaged. The reduced-resolution datasets are mostly time-averaged, with some fields being 
instantaneous. Hourly data intervals are used for the reduced-resolution datasets, while 30-minute 
intervals are used for the full-resolution products. All full-resolution output is on the model’s native 
72-layer vertical grid, while the reduced-resolution output is given on both the native vertical levels 
and on 42 pressure surfaces extending up to 0.1 hPa. GMAO Office Note 6 (da Silva et al., 2014) 
presents additional details on horizontal and vertical grids. 

The GEOS-5 Nature Run data products are organized into file collections that are described in detail 
in da Silva and Putman (2014). Additional details about variables listed in this file specification can 
be found in a separate document, the GEOS-5 File Specification Variable Definition Glossary. 
Documentation about the current access methods for products described in this document can be 
found on the GMAO products page: http ://gmao . gsfc .nasa. go v/products/ . 

This document is organized as follows. Section 2 gives an overview of the ESMF component-based 
architecture adopted in GEOS-5. The cubed-sphere atmospheric dynamics is summarized in Section 
3, while information on the specific model parameterizations are given in Section 4. The treatment of 
atmospheric aerosols is explained in Section 5, followed by a description of the relevant 
parameterizations used for representing CO and CO 2 carbon species in Section 6. Finally, Section 7 
describes the main boundary conditions and external datasets utilized in this simulation. 
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2. GEOS-5 Software Architecture 


2.1 The ESMF Foundation 

The Earth System Modeling Framework (ESMF) is a high-performance, flexible software 
infrastructure to increase ease of use, performance portability, interoperability, and reuse in climate, 
numerical weather prediction, data assimilation, and other Earth science applications. The ESMF 
project was initiated by NASA in 2002 and, over the last decade, it has become a widely used tool in 
the Earth Sciences. Models that have implemented ESMF interfaces include the current GEOS-5 
Earth System model, the Community Earth System Model (CESM), the Weather Research and 
Forecast model (WRF), NOAA's National Environmental Modeling System (NEMS), Naval Research 
Laboratory's Coupled Ocean-Atmosphere Mesoscale Prediction System (COAMPS), and the GFDL 
Modular Ocean Model (MOM4). 


The ESMF defines a component-based architecture 
for composing complex, coupled modeling systems 
and includes data structures and utilities for 
developing individual models. It includes high- 
performance software for representing and coupling 
model components, and a set of utilities for common 
modeling functions. ESMF is implemented as a 
collection of very general programming classes that 
can be used both to construct ESMF components and 
to connect them to one another. These classes thus 
support modelers in building interoperable and portable architecture and the user’s computational code, 
codes. This design is illustrated by the ESMF sandwich 

diagram (Figure 1), where the user's computational code sits between the two ESMF layers. In 
general, componentization with ESMF has been implemented at the level of major physical domains, 
where simulated interactions require inter-component data communications (e.g., atmosphere, ocean), 
and has been implemented as wrappers that minimally modify existing user code. 

2.2 MAPL: A Toolkit for building ESMF Compliant Applications 

As the ESMF became available, several groups were involved in prototyping its use in climate and 
weather prediction models and in data assimilation systems. Comparing the various implementations 
led to two seemingly contradictory conclusions: all implementations are different and much of what 
they do is the same. Both conclusions were anticipated, since ESMF is a general framework designed 
to meet a wide variety of needs. This generality is an important strength of the ESMF design, but it 
also implies that there are many different ways of using ESMF - even when performing very similar 
tasks. 

The Modeling Analysis and Prediction Layer (MAPL) software library (Suarez et al., 2014) arose as a 
response to this early experience, particularly during the construction of GEOS-5. MAPL is based on 
the observation that much of the work done in these initial implementations can be standardized; thus, 
reducing the labor of constructing ESMF applications, as well as increasing their interoperability. 
MAPL provides: 

• Specific conventions and best practices for the utilization of ESMF in climate models 

• A middle-ware software layer (between the model and ESMF) that facilitates the adoption of 


ESMF Superstructure 
AppDriver 

Component Classes: GridComp, CpIComp, State 


I 


r 


ESMF Infrastructure 
Data Classes: Bundle, Field, Grid, Array 
Utility Classes: Clock, Log Err, DELayout, VM, Config 
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Figure 1. The ESMF sandwich diagram, 
illustrating the relationship between the ESMF 
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ESMF by climate models. 

A MAPL -based component is a fully compliant ESMF component that accesses the MAPL library to 
build and execute. The parent code in an application that uses a MAPL -based component is not 
required to use the MAPL machinery for coupling. For example, the MAPL-based GOCART aerosol 
component is implemented in NOAA's National Environmental Modeling System (NEMS), but 
NEMS itself does not use any of the MAPL coupling capabilities. 

The NUOPC ( http://www.earthsystemmodeling.org/nuopc/NUOPC_refdoc/) layer provides similar 
functionality for coupling high-level components such as oceanic and atmospheric models. In 
contrast, MAPL is designed to hierarchically couple components at a much higher level of 
granularity, from physical parameterizations all the way to components that represent the full climate 
system. 

2.3 Overview of GEOS-5 Components 

Unlike other models that use the ESMF at the very high level to couple “model components” (ocean, 
atmosphere, etc.), GEOS-5 relies on the infrastructure provided by MAPL to couple atmospheric 
dynamics, radiation, moist processes, ocean and land surface, chemistry and aerosol processes, etc. 
Figure 2 illustrates the hierarchical organization of grid components in the version 1 of the GEOS-5 
AGCM used to perform the 7-km G5NR. 



Figure 2. Part of the hierarchical structure of the ESMF grid components in GEOS-5. This diagram was 
automatically generated from the GEOS-5 source code used for the Nature Run. 

The three main components are the atmospheric GCM (agcm), the oceanic GCM ( ogcm ), and the data 
assimilation related component ( mkiau ) which is not active in this simulation). Notice that the G5NR 
was performed with prescribed sea surface temperature and sea ice ( datasea and dataseaice). Among 
the several choices of dynamics, G5NR was run with the Finite-volume dynamics on the cubed- 
sphere grid ( FV dycoreCubed ). The breakdown of the physics component appears in Fig. 3, showing 
the parameterization of moist processes (moist), surface processes, radiation, turbulence, chemistry 
and gravity wave drag ( gwd ). While this system provides several chemistry options, G5NR was run 
with GOCART aerosols (the CO and CO 2 tracers are also embedded in the GOCART ESMF module) 
and the Parameterized Chemistry ( pchem ) component. The Chemistry component ( chem ) provides 
aerosols, ozone and other radiatively active gases for the radiation parameterization. Additional 
information on these components are given below. 


1 Specifically, this system is based on module Ganymed with CVS Tag wmp-Ganymed-4 _0 _BETA8 . 
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Figure 3. The hierarchical structure of the GEOS-5 physics component and its children in the ESMF framework. 
This diagram highlights the modular nature of the different physical process components in GEOS-5, including the 
adoption of the GOCART chemistry package, and other chemical modules (GAAS, CARMAchem, MAMchem, 
GMIchem, StratChem) that are options not invoked in the 7-km G5NR simulation. 

3. Atmospheric Dynamics 

The, finite -volume (FV) dynamics utilized within GEOS-5 evolved from a foundation of multi- 
dimensional flux-form schemes from Gudunov (1959) and Van Leer (1977), using higher order sub- 
grid schemes with the introduction of the “piecewise parabolic method” of Woodward and Colella 
(1981). These methods were generalized to multi-dimensional schemes for global atmospheric 
modeling within the FV dynamics (Lin and Rood, 1996). 

The flux-form scheme consists of a two-grid approach under the shallow water framework for 
consistent transport of mass and absolute vorticity. A split-explicit time-stepping approach is applied 
to produce a fully explicit shallow water scheme with sufficient computational efficiency (Lin and 
Rood, 1997). The pressure gradient term follows a finite- volume approach in terrain following 
coordinates (Lin 1997, 1998), using a Lagrangian control-volume vertical coordinate (Lin and Rood 
1998, 1999) simplifying the 3-dimensional scheme to a series of stacked 2-dimensional shallow water 
layers, periodically remapped to an Eulerian terrain-following coordinate with a mass, momentum, 
and total energy conserving algorithm (Lin, 2004). 

The original FV algorithm (Lin, 2004), designed for orthogonal coordinate systems, has been 
extended to operate in a general curvilinear coordinate system on the cubed-sphere grid (Putman and 
Lin, 2007, 2009), with explicit treatment of edge discontinuities at intersecting faces of the cubed- 
sphere grid. The cubed-sphere FV dynamics also includes an improved piecewise-parabolic method 
(PPM) for construction of sub-grid distributions in the advection scheme with the addition of Huynh’s 
2nd constraint (Huynh, 1996) for monotonicity. 
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One of the most unique aspects of the hydrostatic FV dynamics is its use of the terrain-following 
Lagrangian coordinate system (Lin, 2004). The improved accuracy in vertical transport characteristics 
relative to commonly used cores, such as the spectral core, has been documented by Rasch et al. 
(2006). At cloud-permitting resolution (1-15 km), the vertical velocity can become very large (-30 
m/s updrafts are not uncommon with -3-10 km resolution). The use of a Lagrangian coordinate 
system, as in the FV dynamics, removes this potentially severe time-step restriction without ad hoc 
vertical-velocity damping. 

The hydrostatic formulation of Lin (2004) has been extended to the fully compressible non- 
hydrostatic flow (essentially the unapproximated Euler equations on the sphere). To maintain the 
advantages of the “vertically Lagrangian discretization” of the hydrostatic system, an explicit sound 
wave solver based on the conservation of Riemann invariants was developed. Due to the vertical CFL 
condition imposed by the vertically propagating sound waves, traditional “Riemann solvers” available 
in the literature require the use of a prohibitively small time step (e.g., Carpenter et al., 1990). The 
Riemann solver we developed for the non-hydrostatic FV dynamics is consistently “Lagrangian,” in 
the sense that contributions from all the sound waves within the entire physical domain of dependence 
are considered, not just the two nearby volumes, as in typical finite-volume CFD (computational fluid 
dynamics) solvers. As a result, the time step is not severely limited by the vertically propagating 
sound waves: the model runs stably with sound-wave Courant numbers up to 100. 

The 7-km G5NR is executed with a heartbeat time step of 300 seconds. This heartbeat is the time step 
at which the physics and dynamics components are called during execution. The FV dynamics further 
sub-cycles this time step for CFL stability, executing at a small time step of five seconds while 
remapping the Lagrangian vertical coordinate back to the Eulerian terrain-following coordinates every 
75 seconds. A vertical sponge layer is included in the FV formulation to damp the top layers and 
prevent reflection of vertically propagating waves off of the model top from contaminating the 
simulation. This sponge layer is applied to the top two model levels with an increased amplitude of 
second order horizontal diffusion applied to these layers. 

4. Model Physics: Parameterizations 

The GEOS-5 AGCM physics includes parameterization schemes for atmospheric convection, large- 
scale precipitation and cloud cover, longwave and shortwave radiation, turbulence, gravity wave drag, 
a land surface model, and a simple glacier model. These physics parameterizations are scale aware, in 
that they dynamically adapt to the horizontal resolution of GEOS-5 (as described in the text). This 
multi-scale design allows GEOS-5 to easily move from climate simulations on the order of 50- to 
100-km resolutions, to medium range weather prediction and data assimilation resolutions of 25-km, 
to cloud permitting resolutions of 14- to 3.5-km (Putman and Suarez, 2011). 

Convection is parameterized using the Relaxed Arakawa-Schubert (RAS) scheme of Moorthi and 
Suarez (1992) and includes a scheme for the generation and re-evaporation of falling rain (Bacmeister 
et al., 2006). RAS is a mass flux scheme with an updraft-only detraining plume cloud model and a 
quasi-equilibrium closure. As resolution is increased, the convection parameterization is restrained 
using a stochastic, resolution-dependent limit on deep convection (Tokioka et al., 1988). As resolution 
increases approaching cloud resolving scales (15- to 1-km), the large-scale moist processes begin to 
explicitly resolve some of the deep convection. This stochastic limiting scheme essentially prevents 
deep convection and restricts RAS to act as a shallow convection scheme. For the 7-km G5NR, this 
limiting parameter (MAXD ALLOWED) is set to 450. Additionally, the RAS scheme is constrained 
by the choice of shallow and deep convective time-scales: the longer the time-scale, the more 
restrained RAS becomes. In the 7-km G5NR these time-scales (RASAL1 and RASAL2) are set to 
1800 and 43200 seconds for shallow and deep convection respectively. 

The prognostic cloud cover and cloud water and ice scheme is from Bacmeister et al. (2006), with the 
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total water probability distribution function (pdf) of Molod (2012). The critical relative humidity 
within the pdf increases as the grid cell area shrinks, for the 7-km G5NR this parameter 
(MINRHCRIT) is set to 0.98. The scheme includes large-scale condensation, evaporation, 
autoconversion and accretion of cloud water and ice, sedimentation of cloud ice and re-evaporation of 
falling precipitation. 

As in the GEOS-5 configuration described in Rienecker et al. (2008), the longwave radiative 
processes are described by Chou and Suarez (1994), and include absorption due to cloud water, water 
vapor, carbon dioxide, ozone, nitrous oxide and methane. The shortwave radiation transfer module is 
from Chou (1990) and Chou (1992), and includes absorption by water vapor, ozone, carbon dioxide, 
oxygen, cloud water, and aerosols and includes scattering by clouds water and aerosols. 

The turbulence parameterization is based on the Lock et al. (2000) scheme, acting together with the 
Richardson-number-based scheme of Louis and Geleyn (1982). The Lock scheme includes a 
representation of non-local mixing (driven by both surface fluxes and cloud-top processes) in unstable 
layers, either coupled to or decoupled from the surface. The original scheme was extended in GEOS-5 
to include moist heating and entrainment in the unstable surface parcel calculations. The Monin- 
Obukhov similarity theory based parameterization of surface layer turbulence is described in Helfand 
and Schubert (1995), and includes the effects of a viscous sublayer for heat and moisture transport 
over all surfaces except land. The ocean roughness is determined by a blend of the algorithms of 
Large and Pond (1981) and Kondo (1975), modified in the mid-range wind regime according to 
Garfinkel et al. (201 1) and in the high wind regime according to Molod et al. (2013). 

The gravity wave parameterization computes the momentum and heat deposition into the grid-scale 
flow due to orographic (McLarlane, 1987) and nonorographic (after Garcia and Boville, 1994) gravity 
wave breaking. Mountain waves are forced by the sub-grid orographic variability, the variance of the 
orography is scaled down with increasing resolution to account for the better resolved topographically 
induced gravity waves due to increased resolution in the dynamics. This variance is controlled by 
model parameter (ELLGWORO) which is set to 0.015625 in the 7-km GEOS-5 Nature Run. The 
smallest scales (< 10km) are not used to force gravity waves, but enter into an orographic form drag 
used in the turbulence module. 

The Land Surface Model from Koster et al. (2000) is a catchment-based scheme which treats subgrid- 
scale heterogeneity in surface moisture statistically. The applied subgrid-scale distributions are related 
to the topography, allowing it to exert a major control over much of the subgrid variability. Lor 
glaciated land, the surface is represented with a 15-layer ice column for the conduction of heat below 
the snow-ice interface (Cullather et al., 2014), while the overlying snow cover is allowed to be 
fractional. The catchment and glacier models are each coupled to the multi-layer snow model of 
Stieglitz et al. (2001). Southern Hemisphere sea ice albedo is prescribed to be 0.6, while Northern 
Hemisphere sea ice albedo varies on the annual cycle based on observed values (Duynkerke and de 
Roode, 2001). 

5. Atmospheric Aerosols 

A version of the Goddard Chemistry, Aerosol, Radiation, and Transport model (GOCART, Chin et 
al., 2002) is run online, with coupling to the GEOS-5 radiation code (Colarco et al., 2010). GOCART 
treats the sources, sinks, and chemistry of dust, sulfate, sea salt, and black and organic carbon 
aerosols. Aerosol species are assumed to be external mixtures. Total mass of sulfate and hydrophobic 
and hydrophilic modes of carbonaceous aerosols are tracked, while for dust and sea salt the particle 
size distribution is explicitly resolved across five non-interacting size bins for each. Both dust and 
sea-salt have wind-speed dependent emission functions, while sulfate and carbonaceous species have 
emissions principally from fossil fuel combustion, biomass burning, and biofuel consumption, with 
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additional biogenic sources of organic carbon. Sulfate has additional chemical production from 
oxidation of SO 2 and DMS, and we include a database of volcanic SO 2 emissions and injection 
heights. Details about the specific emission datasets are given in Section 7. 

For all aerosol species, optical properties are primarily from the commonly used OPAC data set 
(Hess et al. 1998). We have recently updated our dust optical properties data set to incorporate non- 
spherical dust properties based on Meng et al. (2010), which permits, for example, calculation of the 
aerosol depolarization ratio. Stratospheric aerosol perturbations and their chemical, radiative, and 
dynamical impacts in GEOS-5 have been studied in the context of volcanic eruptions (Aquila et al. 
2012, 2013, 2014), geo-engineering (Pitari et al., 2014), and meteor inputs (Gorkavyi et al., 2013). 
The GEOS-5/GOCART system has also been used to investigate the impact of aerosols on tropical 
cyclones (Reale et al., 2011, 2014), aerosol semi-direct effects (Randles et al., 2013), Indian Monsoon 
(Kishcha et al., 2014, Yi et al., 2014), Saharan dust transport (Wong at al., 2008, Nowottnick et al., 
2011, Colarco et al. 2013), and the aerosol impact on snow albedo (Yasunari et al., 201 1,2014) 

6. Carbon Species 

In addition to aerosol species, GEOS-5 simulates the emission, uptake and transport of carbon 
monoxide (CO) and carbon dioxide (CO 2 ). A simplified version of CO chemistry, described in Ott et 
al. (2010), is used to increase computational efficiency. CO is emitted from biomass burning (Section 
7.2), fossil and bio-fuel (section 7.2) combustion, and produced chemically from biogenic 
hydrocarbon and methane oxidation. CO emissions from fossil fuels, biofuels, and biomass burning 
are increased by 20%, 19%, and 11%, respectively, in order to account for CO production from non- 
methane hydrocarbons emitted from these sources. Biogenic isoprene and monoterpene emissions 
were calculated by the Global Modeling Initiative (GMI: Rotman et al., 2001) combined troposphere- 
stratosphere chemical transport model (CTM) using the method of Guenther et al. (1995) and are 
released directly as CO after applying an estimated yield during the oxidation of these species 
following Duncan et al. (2007). Monthly mean methane fields are used to calculate CO produced by 
methane oxidation as described in Bian et al. (2007). In order to calculate CO loss through reaction 
with OH, monthly mean OH fields produced by the GMI CTM for the year 2006 are used along with 
a prescribed loss frequency for CO. 

CO 2 is also emitted by fossil fuel combustion and biomass burning. Natural fluxes of CO 2 between 
the atmosphere and land and ocean carbon reservoirs, calculated as part of NASA’s Carbon 
Monitoring System (CMS) Flux Pilot Project (Ott et al., 2014, in review), are also included. Monthly 
net primary production (NEP) and ecosystem respiration were computed using the Carnegie- Ames- 
Stanford- Approach - Global Fire Emissions Database version 3 (CASA-GFED3) model (Randerson 
et al., 2013) and disaggregated to daily time intervals following Olsen and Randerson (2004). NEP 
fluxes, initially provided at 0.5-degree spatial resolution, were downscaled to 0.1 degree for the 
Nature Run by assuming that GPP is proportional to MODIS Enhanced Vegetation Index (EVI; Huete 
et al., 2002), a measure of vegetation greenness. Ecosystem respiration is downscaled by first 
assuming that, if vegetation is present in a 0.5 degree grid cell, heterotrophic and autotrophic 
contributions are of equal magnitudes. Autotrophic respiration is then assumed to be proportional to 
EVI while heterotrophic respiration is distributed uniformly across all land grid cells within the 0.5° 
grid box. In cells where no vegetation is present, all respiration is distributed uniformly across all land 
grid cells. Following the spatial downscaling, daily NPP and respiration are downscaled to three-hour 
periods following Olsen and Randerson (2004) to ensure a realistic diurnal cycle in atmospheric CO 2 . 
Ocean CO 2 fluxes are calculated online in GEOS-5 following Wanninkhof (1992) using as input daily 
ocean surface partial pressure of CO 2 and salinity from NASA’s Ocean Biogeochemical Model 
(NOBM; Gregg et al., 2003); sea surface temperatures (Reynolds et al., 2007); and GEOS-5 
atmospheric CO 2 mixing ratios and 10-meter wind speeds. 
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7. Boundary Conditions and External Datasets 


7.1 Sea-surface Temperature and Sea-Ice 

Sea surface temperature and sea- ice are derived at 14- degree from a combined Reynolds (Reynolds., 
2007) OSTIA (Donlon et al., 201 1) blended product. These daily SSTs are interpolated to the 7-km 
cubed-sphere grid and the data ocean component of GEOS-5 is called on the 30-minute radiation 
time- step. 

7.2 Land surface boundary conditions 

Mosaic land cover classes on the 7-km cubed-sphere grid (cl440) catchment-tiles were derived using 
1km SiB2 land classification data from the USGS Global Land Cover Characteristics Data Base 
Version 2.0. Soil texture classes were derived using global data from the 5 arc-minute NGDC 
(National Geophysical Data Center) soil data (Reynolds et al., 1999). For each texture class, 
corresponding Cosby et al. soil hydraulic parameters were obtained from the second Global Soil 
Wetness Project (GSWP-2). Vegetation Parameters were derived from 1-degree GSWP-2 Leaf Area 
Index (LAI) and Greenness Fraction data were interpolated to cl440 catchment- tiles. 8-day 
Climatological cycles of diffused visible and diffused near-infrared albedo were computed using 30 
arc-second MODIS (v5) data from the period 2001-2011. We scale model albedo to match the 8-day 
MODIS albedo climatology. Hydrologic catchment delineation data from HYDRO- lk were used to 
define cl440 catchment-tiles. The statistics of compound topography on each hydrologic catchment 
were also computed using HYDRO lk data. 

7.3 Biomass Burning Emissions 

Emissions of organic carbon (OC), black 
carbon (BC), sulfur dioxide (SO 2 ), carbon 
monoxide (CO) and carbon dioxide (C02) 
from biomass burning are obtained from 
the Quick Fire Emissions Dataset (QFED) 
version 2.4-r6. The QFED is based on the 
fire radiative power (top-down) approach 
and draws on the cloud correction method 
used in the Global Fire Assimilation 
System (GFAS, Kaiser et al. (2012)) but in 
addition it employs a more sophisticated 
treatment of emissions from non-observed 
land areas (Darmenov and da Silva, 2014). 

Location and fire radiative power of fires 
are obtained from the Moderate Resolution 
Imaging Spectroradiometer (MODIS) Level 2 fire products and the MODIS Geolocation products. 
Data from the Level 2 fire products are gridded at 0. 1x0.1 degrees horizontal resolution and combined 
to create daily mean emissions at the same resolution. A diurnal cycle is imposed online on the daily 
mean emission values that is more prominent in the tropics and gradually weakens at higher latitudes 
in the North hemisphere’s temperate zone. Seasonal mean biomass burning emissions of black carbon 
from the 0.5x0. 5 degrees model output are shown in Figure 4. Spatial and temporal patterns of OC, 
SO 2 , CO and CO 2 emissions are similar to the patterns of the BC emissions and are not shown here. 
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Figure 4. Seasonal mean QFED emissions of black carbon (BC (ig 
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7.4 Anthropogenic Emissions 

Anthropogenic emissions of carbon species and aerosols are largely taken from the Emissions 
Database for Global Atmospheric Research (EDGAR; Olivier et al., 1994; EDGAR Project Team, 
2009), which are provided annually at 0.1-degree resolution. For CO and CO 2 , EDGAR v4.2 
emissions from 2005 through 2007 were used in 7-km G5NR. CO emissions are temporally 
disaggregated from yearly to monthly using information on the seasonal cycle of fossil fuel emissions 
from Bey et al. (2001) while CO 2 fossil fuel seasonal cycles are imposed based on estimates from the 
Carbon Dioxide Information Analysis Center (CDIAC; Boden et al., 2013). 

For organic and black carbon aerosol species, which are not included in EDGAR v4.2, EDGAR- 
HTAP emissions were used in the Nature Run. To ensure consistency with previous GEOS-5 
simulations, which have used emissions from the Aerosol Comparisons between Observations and 
Models (AeroCom; Myhre et al., 2013) Phase II project, emission totals over 1-degree areas are 
adjusted to match yearly AeroCom emissions estimates for 2004 through 2008. Because EDGAR 
aerosol emissions are currently only available through 2005, the 2005 spatial distribution of EDGAR 
emissions are used to estimate Nature Run emissions in all years. The resulting anthropogenic 
emissions can be thought of as a hybrid between the EDGAR and AeroCom emissions datasets used, 
drawing information on total emissions and interannual variability from AeroCom while using 
EDGAR to achieve a high spatial resolution. SO 2 and SO 4 emissions from ships are handled 
similarly, by combining EDGAR and AeroCom emissions. Non-shipping emissions of SO 2 are taken 
directly from EDGAR v4.1 estimates for 2005 because of errors in AeroCom emissions discussed in 
Diehl et al. (2012). 
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7.5 Oxidant Fields 

The GOCART tropospheric sulfate chemistry 
mechanism includes gas-phase and aqueous-phase 
reactions that account for the major chemical 
production and loss pathways of SO 2 and SO 4 . 
Concentrations of hydroxyl radical (OH), nitrate 
radical (NO 3 ) and hydrogen peroxide (H 2 O 2 ) are 
prescribed using monthly mean constituent fields 
from the NASA Global Modeling Initiative (GMI), 
(Duncan et al., 2007; Strahan et al., 2007) GMI- 
MERRA simulation. The GMI-MERRA simulation 
(Strahan et al., 2013) was performed with the GMI 
chemical transport model, driven by assimilated 
meteorological data from MERRA, using an online 

parameterization of lightning NO x emissions, 
specified biogenic VOC emissions, and biomass 
burning emissions from the GFED-v2 inventory 
(van der Werf et al., 2006). Globally anthropogenic 
emissions were obtained from the EDGAR 
inventory. Over the Unites states, Europe and Asia 
the EDGAR emissions were augmented with 
emissions from the EPA NEI99, EMEP and the 
Streets inventory for 2006 (Zhang et al., 2009), 
respectively. Diurnal variations of OH 
concentrations are computed by scaling the 
monthly mean fields to the cosine of the solar 
zenith angle. Diurnal variations of NO 3 are imposed 
by assuming zero concentrations in daylight and 
distributing the monthly mean values over night- 
time only. Because H 2 O 2 is the limiting agent of the 
aqueous phase formation of SO 4 it is periodically 
reset every three hours to the monthly varying 
values. Prescribed monthly mean values of 
methane (CH4) and OH, from a GMI simulation, 
are also used to calculate the chemical production 
and loss of CO. 
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Figure 7. Zonal mean concentration (10 9 x molecules cm 
of hydrogen peroxide (H2O2). 
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Figure 5. Zonal mean concentration (10 6 x molecules 
cm' 3 ) of hydroxyl radical (OH). 
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Figure 6. Zonal mean concentration (l(r x molecules 
cm' 3 ) of nitrate radical (NO3). 
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Web Resources 


GMAO web site: http : //gmao . gsfc .nasa.gov/ 

NetCDF information: http://www.unidata.ucar.edu/software/netcdf/ 
CF Standard Description: http://cf-pcmdi.lliil.gov/ 

The HDF Group: http://www.hdfgroup.org / 

Acronyms 


ADAS 

AOT 

CF 

CLSM 

COARDS 

DMS 

ECS 

EOS 

ESDT 

ESMF 

FP 

GES DISC 

GMAO 

GRIB 

GSI 

HDF 

IAU 

JCSDA 

MSA 

NCEP 

NetCDF 

PAR 

TOA 

TOMS 

UTC 

atmospheric data assimilation system 
aerosol optical thickness 
Climate and Forecast metadata convention 
Catchment Fand Surface Model 

Cooperative Ocean/Atmosphere Research Data Service metadata convention 

dimethylsulphide 

EOS Core System 

Earth Observing System 

Earth Science Data Type 

Earth System Modeling Framework 

Forward-processing 

Goddard Earth Sciences Data and Information Services Center 
Global Modeling and Assimilation Office 
GRIdded Binary 

Gridpoint Statistical Interpolation 

Hierarchical Data Format 

Incremental Analysis Update 

Joint Center for Satellite Data Assimilation 

methane sulphonic acid 

National Center for Environmental Prediction 

Network Common Data Form 

photosynthetically active radiation 

top of atmosphere 

Total Ozone Mapping Spectrometer 

Universal Time, Coordinated 
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Appendix A: Vertical Structure 

A.l Hybrid Sigma-Pressure Levels 


Products on the native vertical grid will be output on the following levels. Pressures are nominal for a 
1000 hPa surface pressure and refer to the top edge of the layer. Note that the bottom layer has a 
nominal thickness of 15 hPa. 


Lev 

P(hPa) 

Lev 

P(hPa) 

Lev 

P(hPa) 

Lev 

P(hPa) 

Lev 

P(hPa) 

Lev 

P(hPa) 

1 

0.0100 

13 

0.6168 

25 

9.2929 

37 

78.5123 

49 

450.000 

61 

820.000 

2 

0.0200 

14 

0.7951 

26 

11.2769 

38 

92.3657 

50 

487.500 

62 

835.000 
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0.0327 

15 

1.0194 

27 

13.6434 

39 

108.663 

51 
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0.0476 

16 

1.3005 

28 

16.4571 

40 

127.837 
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562.500 

64 

865.000 
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0.0660 

17 

1.6508 

29 

19.7916 

41 

150.393 

53 

600.000 

65 

880.000 
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0.0893 

18 

2.0850 

30 

23.7304 

42 

176.930 

54 

637.500 

66 

895.000 
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0.1197 

19 

2.6202 

31 

28.3678 

43 

208.152 

55 

675.000 

67 

910.000 

8 

0.1595 

20 

3.2764 

32 

33.8100 

44 

244.875 

56 

700.000 

68 

925.000 
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0.2113 

21 

4.0766 

33 

40.1754 

45 

288.083 

57 

725.000 

69 

940.000 

10 

0.2785 

22 

5.0468 

34 

47.6439 
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337.500 

58 

750.000 

70 

955.000 

11 

0.3650 

23 

6.2168 

35 

56.3879 

47 

375.000 

59 

775.000 

71 

970.000 

12 

0.4758 

24 

7.6198 

36 

66.6034 
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412.500 

60 

800.000 

72 

985.000 
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Appendix B: Surface Representation 


In GEOS-5 the surface below each atmospheric column consists of a set of tiles that represent various 
surface types. Tiles can be of four different types: Ocean, Land, Ice, and Lake, as illustrated in Ligure 
8. In each grid box a single Ice tile represents those areas covered by permanent ice. Similarly a single 
Lake tile represents continental areas covered permanently by water. Other continental areas (non 
Lake or Ice) can be further subdivided into tiles that represent parts of the grid box in different 
hydrological catchments, defined according to the Pfafstetter (1989) system. Each of these is, in turn, 
divided into subtiles (not shown in figure 8) that represent the wilted, unsaturated, saturated, and 
snow-covered fractions of the tile. These fractions vary with time and are predicted by the model 
based on the hydrological state of the catchment and its fine-scale topographic statistics. Details of the 
land model, including the partitioning into sub tiles, can be found in Koster et al. (2000). The Ocean 
tile can be divided into two subtiles that represent the ice-covered and ice-free parts of the ocean part 
of the atmospheric grid box. The fractional cover of these subtiles also varies with time. 



Figure 8. Schematic of the Ocean, Land, Ice and Lake tiles used in the land-surface model, as described in 
Appendix B. 
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